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Abstract 

We consider the robustness of computational hardness of problems whose input is obtained 
by applying independent random deletions to worst-case instances. For some classical MV- 
hard problems on graphs, such as Coloring, Vertex-Cover, and Hamiltonicity, we examine the 
complexity of these problems when edges (or vertices) of an arbitrary graph are deleted inde¬ 
pendently with probability 1 — p > 0. We prove that for n -vertex graphs, these problems remain 
as hard as in the worst-case, as long as p > ^rW for arbitrary e £ (0,1), unless MV C BVV. 

We also prove hardness results for Constraint Satisfaction Problems, where random deletions 
are applied to clauses or variables, as well as the Subset-Sum problem, where items of a given 
instance are deleted at random. 
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1 Introduction 


The theory of ATP-hardness suggests that we are unlikely to find optimal or near optimal solutions 
to ATP-hard problems in polynomial time. This theory applies to worst-case settings where one 
considers the worst running-time over the worst possible input. It is less clear whether these 
hardness results apply to “real-life” instances. One way to address this question is to examine 
to what extent known ATP-hardness results are stable under random perturbations, as it seems 
reasonable to assume that a given instance of a problem may be subjected to noise originating from 
multiple sources. 

In this work we study worst-case instances that are subjected to random perturbations of a 
specific type, namely, random deletions. We focus on the following deletion process, known as edge 
percolation : given a graph G consider a random subgraph of G obtained by deleting each edge of G 
independently with probability 1 — p (where p € (0,1) may depend on the size of the instance). This 
model generalizes familiar random graph models such as the Erdos-Renyi random graph G(n,p). 
Instead of focusing on deleting edges at random from the complete graph, our starting graph G may 
be chosen arbitrarily out of all n-vertex graphs. Then the edges of G are deleted independently 
with probability 1 — p. The case of vertex percolation, where the vertices of a given graph are 
deleted independently at random is examined as well. We also study random deletions in other 
A/'T’-complete problems, such as 3-SAT and Subset-Sum. 

Throughout we refer to instances that are subjected to random deletions as percolated instances. 
Our main question is whether such percolated instances remain hard to solve by polynomial-time 
algorithms, under reasonable assumptions from complexity theory. 

A first example. Consider the 3-Coloring Problem, where given a graph G we need to decide 
whether G is 3-colorable. Suppose we sample a random subgraph G' of G , by deleting each edge 
of G independently with probability 1/2, and ask whether the resulting graph is 3-colorable (one 
can prove similar results when edges are deleted with probability smaller than 1/2, but we focus 
on the case where p = 1/2 for concreteness). Is there a polynomial time algorithm can decide with 
high probability whether G' is 3-colorable? Or does the problem remain hard in the sense that an 
efficient algorithm that determines whether G' is 3-colorable would imply that every problem in 
MV admits an efficient algorithm? 

We demonstrate that a polynomial-time algorithm that decides whether G' is 3-colorable is 
unlikely. We show it by considering the following polynomial time reduction from the 3-Coloring 
Problem to itself. Given an n-vertex graph H the reduction outputs a graph G that is an R-blow-up 
of H for R = Cy / log(n) where C > 0 is large enough. That is, each vertex of H is replaced by 
a cloud of R vertices that form an independent set in G, and each edge of H is replaces with a 
complete R x R bipartite graph in G between the corresponding clouds in G. It is clear that H is 
3-colorable if and only if G is 3-colorable. 

In fact, the foregoing reduction satisfies a stronger robustness property for the random subgraph 
G' of G. Namely, if H is 3-colorable, then G is 3-colorable, and hence G' is also 3-colorable with 
probability 1. On the other hand, if H is not 3-colorable, then G is not 3-colorable, and with high 
probability G' is not 3-colorable either. Indeed, for any edge (y\,V 2 ) in H let U\, U 2 be two clouds 
in G corresponding to v\ and v-i- Fixing two arbitrary sets U[ C U\ and U' 2 C U 2 each of size at 
least R/ 3, the probability there is no edge connecting a vertex from U\ to a vertex in U 2 is at most 
2 —R 2 /9 _ 2 ~ c ' l °s n _ By union bound we get that with high probability (over the sampling of G') 
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for any two clouds U\,U 2 corresponding to an edge in H and any U[ C U\ and U 2 C U 2 each of size 
at least R/3 there is at least one edge between U[ and U 2 . Therefore, with high probability any 
3-coloring of G' can be decoded to a 3-coloring of H by coloring each vertex v of H with the color 
that appears the largest number of times in the coloring of the corresponding cloud in G' (breaking 
ties arbitrarily). This suggests that unless A fV C co 1ZV there is no polynomial time algorithm 
that given a 3-colorable graph G finds a legal 3-coloring of a random subgraph of G obtained by 
subsampling every edge with probability 1/2. 

1.1 Motivation 

There is a large body of research dealing with computational problems on random graphs and 
formulas [19]. This study has resulted with several algorithms which have proven effective on 
random instances. A more recent line of research suggests that efficient algorithms for finding exact 
or approximate solutions to computational problems on random objects may not exist [1, 11, 31]. 
Other works have demonstrated that assuming problems on randomly generated instances to be 
hard, implies hardness of approximation results for certain optimization problems that are not 
known to follow from worst-case assumptions [15]. These results raise the question of what kind of 
hardness results for solving optimization problems exactly or approximately for percolated instances 
can be derived when the original instance is selected in a worst-case fashion. We note that proving 
hardness results for our model should be an easier task than proving hardness results for random 
instances such as those arising, for example, from the Erdos-Renyi random graph G(n,p), as we 
have more freedom in choosing the instance that is subjected to random deletions. 

The study of random discrete structures has resulted with a wide range of mathematical tools 
which have proven instrumental in proving rigorous results regarding such structures [9, 19, 21, 
29]. Our hybrid model may offer the opportunity to apply these methods to a broader range of 
distributions of instances of MV- hard problems. 

1.2 Our results 

We consider several classical AAR-hard problems, for which we prove that they remain hard also on 
percolated instance. Unless stated otherwise, n stands for the number of vertices in the graph. 

• For the Maximum Independent Set problem we use the hardness of approximation result 
of [16] to show that for edge percolation, where we keep each edge of a given graph with 
probability p > ^4^ for some e € (0,1) it is hard to approximate the maximal independent 
set on percolated instances within any factor better than U ( pri \- e ) We also show that the 
chromatic number of a percolated instance in hard to approximate within 0(pn 1 ~ £ ). Note 
that for p > (in fact, for p > C lo ^ ^ ) such random percolated graphs have maximal 

degree at most 0(pn) with high probability, and hence can be colored efficiently using 0(pn) 
colors. 

We also prove that for vertex deletion these problems remain as hard to approximate as in 
the worst-case, as long as the vertices remain in the graph independently with probability 
p > ^4^ for some e € (0,1). More specifically, denoting by m the number of remaining 
vertices in the vertex percolated subgraph, it is hard to approximate its chromatic number or 
independence number within a factor of m 1 ”* 5 (resp. Is ) for arbitrary constant <5 € (0,1). 
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• For the Vertex-Cover problem, we prove that for any constant 5 > 0 an algorithm that gives 
2 — 5 approximation for percolated instances implies also a 2 — 2 5 approximation algorithm 
for worst-case instances. Our results hold for both edge and vertex percolation, where the 
edges or the vertices of a given graph remain with probability p > for some £ € (0,1). 
In particular, assuming the Unique Games Conjecture, the results of [25] imply there is no 
randomized polynomial time algorithm that with high probability gives 2 — 5 approximation 
for the Vertex Cover problem on percolated instances. 

• For the Hamiltonicity problem, we prove hardness results for percolated instances with respect 

to edge percolation on directed graphs. We show that the problem where one needs to 
determine whether a graph contains a Hamiltonian cycle is also hard for percolated graphs, 
where each edge of a given graph is kept in the graph with probability p > - ) for any 

£ € ( 0 , 1 ). 

• We also consider percolation of 3-SAT instances where clauses are deleted at random with 

probability p. We prove that, unless MV C coTZV for every e, 5 € (0,1) if clauses of a 
given 3-SAT survive with probability p > then (7/8 + ^-approximation on percolated 

instances is hard, as it is the case for worst-case instances. This result is nearly tight, as known 
algorithms for random 3-SAT formulas imply that for sufficiently small c > 0 if clauses survive 
deletions with probability p > ^, the resulting formula admits a satisfying assignment which 
can be found efficiently with high probability (see the related works section for more details). 

More generally, we prove that unless A fV C BW arbitrary k-aiy Boolean CSP problems are 
as hard to approximate on percolated instances as in the worst-case, as long as each clause is 
percolated with probability p > nfc _ 1 1 _ e for any e € (0,1). 

The key step in the proof is establishing that any hardness of approximation of a k -ary CSP 
problems can be translated to the same hardness approximation on instances whose number of 
constraints is n k ~ v for arbitrary small rj > 0. For example, relying on the result of Hastad [22] 
we show that 3-SAT is ATP-hard to approximate with a ratio better than 7/8 + e even on 
instances that contain at least n 3 ~ v clauses. 

We also consider variable percolation, where each variable is deleted with probability p > 
for any £ € (0,1) (when a variable is removed all clauses containing it are removed as well). 
Similar ideas as those applied for the clause percolation case imply that such percolated 
instance are essentially as hard as in the worst case. 

• We study percolation on instances of the Subset-Sum problem, where each item of the set 
is deleted with probability 1 — p. We show that the problem remains hard as long as p = 
fl( i/ 2 _ e ) for some £ € (0,1/2), where n is the number of items in the given instance. 

1.3 Our techniques 

In proving hardness results for percolated instances we use the concept of robust reductions which 
we explain next. It will be convenient to consider promise problems. Recall, that a promise problem 
is a generalization of a decision problem, where for the problem L there are two disjoint subsets 
Lyes and Leo, such that an algorithm that solves L must accept all the inputs in Lyes and reject 
all inputs in Leo- If the input does not belong to Lyes U Leo-, there is no requirement on the 
output of the algorithm. 
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Definition 1.1. For each y G {0,1}* let Perc(y ) be a distribution on {0,1}* ; that is samplable in 
time that is polynomial in \y\. 

For two promise problems A = (Ayes, Ano) and B = (Byes, Bno) a polynomial time reduc¬ 
tion r from A to B is said to be Perc-robust if 

1. For all x G Ayes it holds that r(x ) € Byes> and Pr [Perc(r(x)) G Byes\ > 1 — o(l). 

2. For all x G Ano it holds that r(x) G Bno, and Pr [Perc(r(x)) G Bno] > 1 — o(l). 

If in the first item we have Pr [Perc(r(x)) G Byes\ = 1, then we say that r is a Perc-robust 
colZV-reduction. Similarly, if in the second item we have Pr [Perc(r(x)) G Bno] = 1, then we say 
that r is a Perc-robust IZV-reduction. 

Let us elaborate on how robust reductions apply when graph percolation is concerned. Let A, B 
be MV languages over graphs, and given a graph y , let Perc(y ) be vertex or edge percolation of 
y. In such setting a reduction r is said to be Perc-robust if it satisfies the standard definition of a 
reduction, i.e., x € A if and only if r(x) € P, and in addition the containment of r(x) in Byes (or 
in the Bno ) is robust to random deletions that are captured by the distribution Perc(r(x )). As 
a concrete example, consider the language 3-SAT consisting of all satisfiable 3-CNF formulas, and 
the language HamCycle consisting of graphs containing a Hamiltonian cycle. Consider a reduction 
from 3-SAT to HamCycle that given a 3-CNF formula <f produces a graph G. Let G p be a random 
subgraph of G obtained from G by including each edge of G independently with probability p. The 
reduction is said to be robust with respect to edge percolation if the following two assertions hold: 
(1) if <f> is satisfiable, then G contains a Hamiltonian cycle and G p contains a Hamiltonian cycle 
with high probability, and (2) if 0 is not satisfiable, then G is not Hamiltonian, and with high 
probability G p is not Hamiltonian either. 

We make two remarks regarding the example above. First note that if G = ( V , E ) does not 
contain a Hamiltonian cycle, then neither does any graph G' = (V,E') where E' C E. Therefore, 
if such robust reduction exists, then it is necessarily a robust PP-reduction. 

Note also that such reduction must be such that if <f> is satisfiable, then G contains a Hamiltonian 
cycle, and furthermore G must contain many Hamiltonian cycles, even if cf) has only a small (e.g., 
constant number of satisfying assignments. Indeed, if G contained only K Hamiltonian cycles for 
some constant K, then, G p is unlikely to be Hamiltonian, as typically only pn edges (n is the 
number of vertices of G) of each cycle will remain after percolating the edges. That is, such a 
reduction cannot be a parsimonious reduction in the sense that the reduction preserves the number 
of A/"P-witnesses. 

The existence of such a reduction implies that the Hamiltonicity problem is in some sense MV- 
hard on percolated instances. Below we explain this hardness more precisely. We start with the 
following definition. 

Definition 1.2. Let L = (Lyes, Lno) be a promise problem, and for each y instance of L, let 
Perc(y ) be a distribution on instances of L that is samplable in time that is polynomial in |y|. 

The problem L = (Lyes, Lno) is said to be A/P-hard under a Perc-robust reduction if there 
exists a Perc-robust reduction from an MV-hard problem to L. 

We use the term Perc-robust to avoid confusion with other notions of robust reductions that 
have appeared in the literature. In order to ease readability, we will often write robust reductions 
instead, always refereing to perc- robust reductions as defined above. 
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Proposition 1.3. Let L = (Lyes, A/vo) be a promise problem, and for each y instance of L, let 
Perc(y) be a distribution on instances of L that is samplable in time that is polynomial in \y\. 

If L is AfV-hard under a Perc-robust reduction, then there is no polynomial time algorithm that 
when given an input y decides with high probability whether Perc(y) € B, unless AfV C BVV. 

If the foregoing hardness holds under a Perc-robust IZV-reduction (coIZV-reduction), then the 
same conclusion holds, unless AfV C IZV (resp. AlV C coIZV). 

For coTT-R-reduction we have the following search decision version of the foregoing proposition. 

Proposition 1.4. Let L = (Lyes, Lno) be a promise problem, and for each y instance of L let 
Perc(y ) be a distribution on instances of L that is samplable in time that is polynomial in \y\. 

If L is AfV-hard under a Perc-robust reduction, then, there is no polynomial time algorithm 
that when given an input y with high probability finds a witness for the assertion Perc(y) € B, 
unless AfV C coIZV. 

An example of an application of Proposition 1.4, consider the 3-SAT problem. Recall that by a 
result of Hastad [22] given a satishable 3-SAT instance 4> it is ATP-hard to find an assignment that 
satisfies significantly more than 7/8 fraction of the constraints of 4>. A stronger conclusion follows 
from Theorem 4.3. Namely, given a satishable 3-SAT instance 4> it is hard to find an assignment 
that satisfies significantly more than 7/8 fraction of the constraints in a random subformula of 4>, 
obtained from 4* be deleting each clause with probability, say, p = 1/2 (while, any assignment that 
satisfies 4>, also satisfies every subformula of 4>). 

To construct robust reductions we use two methods. One is to apply hardness of approximation 
results implied by the PCP Theorem [7, 6]. Intuitively, the gap between YES-case and NO-case in 
such hardness results, makes it possible to prove that percolated instances remain hard as random 
deletions will not affect the optimum by much, keeping (with high probability) the distinction 
between the YES-case and the NO-case. 

When known hardness results do not suffice (as it is the case for the Vertex Cover problem), or 
when hardness of approximation results are unlikely, (as is the case for the Subset-Sum problem 
which admits a PTAS) we “blowup” the instance in a certain way and prove that this blowup 
preserves certain combinatorial properties even when edges (or vertices) are deleted with high 
probability. The most standard blowup technique is to replace, given a graph G, every vertex 
of G with a large independent set and connect two independent sets that correspond to adjacent 
vertices of G by a complete bipartite graph. This method has been previously used to prove the 
ATP-hardness of Feedback Arc Set on tournaments [2] . Other variants of blowup are used for the 
Hamiltonian cycle problem and Subset-Sum. 

1.4 Related Works 

Randomly subsampling subgraphs by including each edge independently in the sample with prob¬ 
ability p has been studied extensively in works concerned with cuts and flows (e.g., [23]). More 
recently, sampling subgraphs has been used to find independent sets [17] (the main sampling tech¬ 
nique used, e.g., the layers model is not independent-it introduces dependencies between sampled 
vertices). The effect of subsampling variables in mathematical relaxations of constraint satisfaction 
problems on the value of these relaxations was studied in [8]. Edge-percolated graphs have been 
also used to construct hard-instance for specific algorithms. For example, [27] proved that the well 
known greedy coloring algorithm performs poorly on the complete r-partite graph in which every 
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edge is removed independently with probability 1/2 and r = n e for e > 0. Namely, for this graph 
G , even if vertices are considered in a random order by the greedy algorithm, with high probability 
n(nss) colors are used to color the percolated graph whereas x(G) < n £ . 

The work of [20] examined the problem of finding a maximum independent set in regular 
graphs where the weights of the vertices are i.i.d. exponential random variables. In this work the 
authors prove that for 3-regular graphs, it is the case that for every e the problem admits a (1 — e) 
approximation in time n k g(e) where k is independent of n and g(e) depends only on e. They also 
prove that for large enough (constant) degree A, the problem of approximating the expected size 
of a maximum independent set in such randomly weighted graphs is essentially as hard as solving 
MIS on graphs with maximal degree A. Our hardness results are based on different ideas than 
those of [20]. 

The field of stochastic optimization is concerned with solving computational problems where 
elements of the instance (e.g., weights, the existence of edges) are random variables. Typically, 
the main focus in this line of works is to design algorithms that make decisions (at least for part 
of the input) before the random variables have been instantiated, with good expected guarantees 
(e.g., [13, 26]). Our work is exclusively concerned with fully instantiated problems. In addition, we 
focus on a very specific type of uncertainty, where every random variable is either zero or one. As 
a result, the sampled objects admit a straightforward combinatorial interpretation (e.g., randomly 
sampled subgraphs or formulas) that is lacking when considering random variables such as the 
exponential distribution. In addition, dealing with a restricted family of random variables makes it 
more challenging to prove hardness results regarding instances with edges or vertices whose weights 
are distributed as these random variables. 

When p is sufficiently small, algorithms for random graphs and random formulas can be proven 
to find the optimal solution (with high probability) for percolated instance. For example, for graph 
coloring, it is known that for p = -hA with some positive constant e > 0, with high probability 
G(n,p) is 2-degenerate, and hence can be 3-colored in polynomial time [28]. Since the property of 
being 2-degenerate is monotone, and as 2-colorability can be decided in polynomial time, it follows 
that for every n- vertex graph and p < one can find in polynomial time with high probability 
a coloring of the edge percolated graph with the minimum number of colors. 

Similar reasoning applies to 3-SAT formulas. It is well known that there exists c > 0 such that 
a random 3-SAT formula in which each possible clause is added independently with probability 
p = -% can be solved with high probability using the pure literal heuristic [10]. As observed 
in [10], if this heuristic fails in finding a satisfying assignment for a formula 4> it will still fail to 
find a satisfying assignment if clauses are added to 4>. This implies that for any n-variable 3-SAT 
formula d>, if p < then the clause-percolated formula is satishable and furthermore a satisfying 
assignment can be found in polynomial time using the pure literal heuristic. 

1.5 Preliminaries 

In this work, we will only consider simple graphs without multiple edges and self loops. When 
directed graphs are concerned we allow the two directed edges (u , v ) and (v, u ) to coexist-such a 
situation is not considered as having multiple edges. 

Given a graph G = (V,E) (that may be directed or undirected) and p G (0,1), we denote by 
G>,e = (V, E ') the probability space of graphs on the same set of vertices, where each edge e € E 
is contained in E' independently with probability p. We will say that G Pte is obtained by edge 
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percolation. We define G P:V = (V',E') as the probability space of graphs, in which every vertex 
v € V is contained in V' independently with probability p. and E' is the subgraph of G induced 
by the vertices V 1 . We will sometime say that G P}V is obtained from G by vertex percolation. 
When dealing the running time on percolated instances we will always measure running time in 
terms of the size of the percolated instance. For edge percolation, it makes little difference as far 
as polynomial-time algorithms are concerned, as the percolated and original graphs have the same 
number of vertices. For vertex percolation, this is not the case, since for tiny values of p the size 
of the percolated graph will be typically much smaller than the size of original graph. In this work 
we will be only dealing with the case where p = , hence with high probability the size of the 

percolated and the original graphs are polynomially related as well. 

Given a graph property V and a sequence of probability distributions (p n )n over n-vertex 
graphs, we will say that V holds with high probability if lin^^oo Pr€ V\ = 1. 

We say that an algorithm approximates a maximization problem within a ratio of 0 < a < 1 
(where a to depend on the size of the instance) if it returns a feasible solution that is at least 
a ■ OPT , where OPT is the value of the optimal solution. Similarly, we say that an algorithm 
approximates a minimization problem within a ratio of b > 1, if it returns a feasible solution that 
is at most b ■ OPT where OPT is the value of the optimal solution. 

We shall rely on the following version of the Chernoff bound (see, e.g., [32]). 

Theorem 1.5 (Multiplicative Chernoff bound). Let X \,...,X n be independent 0-1 random, vari¬ 
ables with Pr [Xi = 1] = p. Then, 

n 

p r[| Xi — pn\ > epn] < e Ce2 P n ; 

i =1 

for some absolute constant C > 0. 

Corollary 1.6. Let ..., Xn\ ■ ■ ■, x[ rn \ ..., Xn"' > be independent 0-1 random variables with 
Pr[Xp) = 1] = p. Then, for some absolute constant C > 0 it holds that 


n 

Pr[3j e [m] : | X^ — pn\ > \JCpn log(m)] < m~ 3 . 
i =1 

Proof. By the multiplicative Chernoff bound above for each j € [m] it holds that Pr[| — 

pn\ > y 7 Cpn log(m)] < < m -4 , where C > 0 is some absolute constant. Therefore, 


Pr[3j € [m] : | 

i =1 


pn | > y 7 Cpn log(m)] 


1 — Pr[Vj € [m] 


< 1 — (1 — m _4 ) m 

< m^ 3 , 


n 

— pn\ < yjCpn log(m)] 

i =1 


as required. 


□ 


2 Graph Coloring, Independent Set and Percolation 

An independent set in a graph G = ( V, E) is a set of vertices that spans no edge. The independence 
number of a graph is the size of an independent set of maximum size. Given a graph G , we denote 







the independence number of G by a(G). A legal coloring of a graph G is an assignment of colors to 
vertices that no two adjacent vertices have the same color. The chromatic number of G, denoted by 
x(G) is the minimum number of colors that allows a legal coloring of G. Clearly, y(G') • a(G ) > n. 

In this section we demonstrate the hardness of approximating a(G) and y(G') in percolated 
graphs for both edge and vertex percolation. We start with edge percolation. 


Edge percolation Here a crucial observation is that a graph without large independent sets 
cannot contain large sets that span a too small number of edges. We need first the following 
lemma, due to Turan (see, e.g. [3]). 

Lemma 2.1. Every graph H with l vertices and e edges contains an independent set of size at least 
i 2 

2 e+l ' 

Corollary 2.2. Let G = (V,E) be an n-vertex graph satisfying a(G) < k. Then every set of 
vertices of size l > k spans at least l(l — k)/2k edges. 

Proof. Let H be a subgraph of G induced by l vertices, and suppose that H spans e edges. Then, 
by Lemma 2.1 we have a(H) > 2^+1 ■ On the other hand, a(H) < a(G) < k, and hence < k, 
as required. □ 

Lemma 2.3. Let G = (V,E) be an n-vertex graph. Then, with high probability a(G Pte ) < 
0 (^ lo §(«p))- 

Proof. Let k = a(G) + 1. Let C > 0 be a large enough constant. By the corollary above, every 
set of size l = C a ^ log(np), spans at least l ^ 2 ^ edges. Hence, by taking union bound over all 
subsets of size l, the probability there exists a set of size l in G p that spans no edge is at most 

(") • a -p)^ < (t)‘ ■ ex p < w"® 


where the last inequality uses the choices of l and k, implying that (^f) 1 < ( np) l and exp(— p l ^ 2 ^ ) 
exp(— VL(l ■ log(np))) = (np)~ n ^. 


< 

□ 


We observe that in general, the upper bound above cannot be improved, as it is well known 
that the independence number of G{n,p ) is O ^ lo g(”P) ^ with high probability (see, e.g., [9]). 

We are now ready to prove that it is hard to approximate the independence number and the 
chromatic number on edge percolated graphs. For this we consider the following gap problem which 
we call Gap-Coloring(y, a), where the YES-instances are all graphs G with y(G) < x an d the NO- 
instances are all graphs G with a(G) < a. (We assume that for n-vertex graphs G the parameters 
of the problem are such that a(G ) • x(G') < n, so that the YES-instances and the NO-instances are 
disjoint sets.) 

Theorem 2.4. Let e € (0,1) be a fixed constant. Let p > n i\ E , and let y = n £ and a = where 
n denotes the number of vertices in a graph. Then, the Gap-Coloring(x, oh) problem is NV-hard 
under a robust reduction with respect to edge percolation with parameter p. 

In particular, unless MV C BVP there is no polynomial time algorithm that approximates either 
a(G Pte ) or x{G Pt e) within a factor pn Lz f (resp. pn l ~ 2e ). 
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Proof. By a result of Feige and Kilian [16], for any e > 0 it is VP-hard to decide whether a given 
n-vertex graph G = (V. E) satisfies x(G) < n £ or a(G) < n £ / 2 . Let G = G P}e be the p-edge 
percolated subgraph of G. Next we claim the following. 

YES-case: If \'(G) < n £ , then x(G) < n £ . 

NO-case: If a(G) < n £ / 2 , then a(G) < V. 

The YES-case is clear, since G is obtained from G by removing edges which can only decrease 
the chromatic number. 

For the NO-case suppose that a(G ) < n £ / 2 . Then, by Corollary 2.3 it follows that with high 
probability a(G p>e ) < O ^^4/4 log(np)^ < as required. 

The “in particular” part of the theorem follows from the fact that an n-vertex graph G it holds 
that x(G) ■ a(G) > n. □ 

Remark. Note that for constant p > 0 (e.g., p = 1/2) this theorem establishes an inapproximability 
for the independence number of G Pje , that matches the inapproximability for the worst case. 

Remark. Note also that for p > ^4^- (in fact, for p > ) such random percolated graphs 

have maximal degree at most 0(pn) with high probability. Therefore, such graphs G can be colored 
efficiently using 0(pn) colors. In particular, with high probability G contains an independent set of 
size 0(1 /p) and hence, a(G ) can be approximated within a factor ofl/pn on p-percolated instances. 

Vertex percolation We now move on to deal with vertex percolation. We show that approxi¬ 
mating the a(G) and x(G) on percolated instances is essentially as hard as worst-case instances, 
even if vertices remain with probability -^4^-, where n is the number of vertices in the graph for 
any e € (0,1). We do it again by proving hardness of the gap problem Gap-Coloring for percolated 
instances. 

Note that in the case of vertex percolation, we (in)approximablity guarantee should depend on 
the number of vertices in the percolated graph G PtV , and not in the original graph. 

Theorem 2.5. Let s,5 € (0,1) be fixed constants. Then, for any p > ^4^- the Gap-Coloring{xi &) 
problem is MV-hard under a robust reduction with respect to vertex percolation with parameter p, 
where x = and a = m £//2 , with m denoting the number of vertices in the vertex percolated 

graph. 

In particular, unless NV C BVV there is no polynomial time algorithm that approximates either 
oc(G p>v ) or x(G p . v ) within a factor m 1_£ for constant any e > 0. 

Proof. For a given p > ^4^- let c = £ (5,1) be such that p = -yr^, and let r/ = e • c/3. By a 

result of Feige and Kilian [16], it is VP-hard to decide whether a given n-vertex graph G = (V, E ) 
satisfies x(G) < n v or a(G) < n v . 

Let G = G PiV be the p-vertex percolated subgraph of G, and let m be the number of vertices 
in G. By concentration bounds, we have \m — pn\ < 0.1 pn with high probability, and we shall 
assume from now on that this is indeed the case. By the choice of the parameters this implies 
n v < ?n £ / 2 Therefore, if x(G) < n v , then x(G) < n 71 < m £ . On the other hand, if a(G) < n^, then 
a(G) < n* 7 < m £ , and the proof follows. □ 
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2.1 Vertex Cover 


An vertex cover in a graph G = {V, E) is a set of vertices S C. V such that every edge e € E is 
incident to at least one vertex in S. that spans no edge. In the Minimum Vertex Cover problem 
we are given a graph G and our goal is to find a vertex cover of G of minimal size. 

Note that for an n-vertex graph G it holds that G contains a vertex cover of size k if and only 
if it contains an independent set of size n — k. 

There is a simple factor 2 approximation algorithm for the Minimum Vertex Cover problem [32]. 
On the hardness side, the problem is TVU-hard to approximate within a factor of 1.3606 [14], and 
assuming the Unique Games Conjecture is known to be WU-hard to approximate within a factor 
of (2 — e) for any constant e > 0 [25]. We prove that the same hardness result hold also when 
instead of worst-case instances one considers 
We will need the following definition. 

Definition 2.6. Given a graph G, the R-blowup of G is a graph G' = (V',E'), where every vertex 
v is replaced by an independent setv of size R, which we also call the cloud corresponding to v. If 
(u,v) G E, then u and v are connected by a complete R x R bipartite graph. 

Edge percolation We have the following simple lemma regarding independent sets in edge per¬ 
colated subgraph of Krr. 

Lemma 2.7. Consider the complete bipartite graph Kr^r with bipartition A,B, and let G p>e be the 
edge percolation of Kr^r with probability p. Then, the probability that there is an independent set I 
in Gp^ e such that |InA| = \IUiB\ = C\og{R)/p is at most R~ 3 , where C is a large enough constant 
independent of n or p. 

Proof. For fixed sets Sa Q A and Sr Q B each of size Clog(R)/p the probability that Sa and Sr 
span no edge is (1 — p)( c ' lo s(- R )/p) . Therefore, by union bound over all 5,4 and Sr the probability 
that there is is an independent set / in G p , e with \I n A\ = \I n B\ = C\og{R)/p is at most 

(d R R)/ ) - ( 1 ~ p) (Cl ° s[R)/p)2 < m 2Cl °s(R)/P e -p(Ciog(R)/p) 2 

which is at most R~ 3 for large enough C. □ 

Consider the following Gap-Vertex-Cover(c, s) problem where the YES-instances are graphs 
that have a vertex cover of size cn, and NO-instances are all graphs whose minimal vertex cover is 
larger than sn, where n is the number of vertices in G. Note that, equivalently, the YES-instances 
are graphs that contain an independent set of size a(G) > (1 — c)n, the NO-instances are graphs 
whose maximal independent set is of size a(G) < (1 — s)n. 

We remark that the result of Khot and Regev [25] proves that assuming the Unique Games 
Conjecture the problem Gap-Vertex-Cover(l — e, 1/2 + e) if A/’U-hard for all constant e > 0. We 
use this fact in order to hardness of approximation for this problem on percolated instances. 

Theorem 2.8. Let e, 5 € (0,1) be fixed constants. Assuming the Unique Games Conjecture, 
Gap-Vertex-Cover( 1—e, 1/2+e) is MV-hard under a robust reduction with respect to edge percolation 
with parameter p for any p > /_;>■ , where n denotes the number of vertices in the given graph. 

In particular, assuming the Unique Games Conjecture (2 — e)- approximation of the Vertex Cover 
problem is hard on edge percolated instances. 
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Proof. By [25] assuming the Unique Games Conjecture, for any e > 0 the problem Gap-Vertex-Cover (1— 
e, 1/2 + e) is ALP-hard. Equivalently, given an IV-vertex graph G is is VP-hard to distinguish be¬ 
tween the case that a{G) > (1/2 — e)N and the case that a(G) < eN. We show a reduction from 
this problem to itself (with slightly larger parameter e) that is robust for edge percolation. 

Consider the reduction that given a graph G outputs the P-blowup of G, which we denote by H , 
with R > n to be chosen later. That is the graph H is a graph on n = NR vertices, and it is clear 
that a(H) = a{G)-R. Therefore, this is indeed a reduction from the Gap-Vertex-Cover(1— e. 1/2+e) 
to itself. We show below that in fact the reduction is robust for edge percolation. In order to do it 
we prove that with high probability 

a(G) ■ R < a(H) < a(G) -R + (C log (R)/p) ■ N, (1) 

where H = H Pje denotes the edge percolation of H with parameter p. Indeed, the left inequality is 
clear because a(H) > a(H ) = a(G) ■ R, since H is a subgraph of H. 

For the right inequality, by Lemma 2.7 with probability at least (1 — N 2 /R 3 ) the following 
holds: for every edge (u, v ) of G the corresponding clouds u and v in H are such that there is no 
independent set / in H, such that \I n u\ > C\og{R)/p and \I n v\ > Clog(R)/p. Therefore, if 
I is an independent set that intersects some clouds on more than C\og{R)/p, then the vertices 
corresponding the these clouds must form an independent set in G. Thus, with probability at least 
(1 — N 2 /R 3 ) we have a(H) < a(G) ■ R+ (Clog(R)/p) ■ N. 

Next we choose the parameter R such that the reduction above is indeed a robust reduction for 
edge percolation with parameter p. For the parameter p let c = and let R = N 2 / c (where 

N is the number of vertices in the original graph). 

Now, if a(G) > (1/2— s)N, then by (1) we have a(H ) > a(G)-R > (1/2— e)NR = (1/2—e)n, and 
hence H contains a vertex cover of size (1/2 + e)n On the other hand, we claim that if a(G) < eN, 
then with high probability a(H) < 2 en. Indeed, by the choice of R we have p = > 

jv • C lo e(^) _ Therefore, by the right inequality of (1) we have a(H) < a(G) ■ R+(C log(R)/p) ■ N < 

2 a(G) ■ R < 2en, and hence H does not have a vertex cover of size (1 — e)n. This completes the 
proof of Theorem 2.8. □ 

Vertex percolation We proceed with vertex percolation. Note that when considering vertex 
percolation, the percolation parameter p depends on the number of vertices in the given (worst- 
case instance) graph, while the performance of the algorithm is measured with respect to the 
number of vertices in the percolated graph, which is close to pn with high probability 

Theorem 2.9. Let e, 5 € (0,1) be fixed constants. Assuming the Unique Games Conjecture, 
Gap-Vertex-Cover(\ — e, 1/2 + e) is NV-hard under a robust reduction with respect to vertex per¬ 
colation with parameter p, for any p > , where n is the number of vertices in the given graph. 

In particular, assuming the Unique Games Conjecture (2 — e )-approximation of the Vertex Cover 
problem is hard on vertex percolated instances. 

Proof. The reduction is the same as in the proof of Theorem 2.8. For the parameters p and e let 
c = so that p = and let R = (^) 1//c . Given a graph G the reduction produces the 

i?-blowup of G, which we denote by H. That is the graph H is a graph on n = NR vertices. 

Let H = H p e denote the vertex percolation of H with parameter p. By Corollary 1.6, with 
high probability the number of vertices in H = H p ^ e , which we denote by m is between pNR — 
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C^/pNR log (NR) and pNR + C yjpNR log (NR) , and the number of vertices in every cloud of H is 
between pR — C^JpR log TV and pR + C^/pRTogN, for some absolute constant C > 0 independent 
of N or p. 

Clearly any independent set / in H gives rise to an independent set in G by taking all vertices 
v of G such that I intersects the corresponding cloud v. This implies that with high probability it 
holds (for N large enough) that 

a(G) • (pR - C^/pRlogiN)) < a(H) < a{G) ■ {pR + C^/pR\og{N)). 

By the choice of R we have R > C ^ , and hence | ot{H) — a(G)pR\ < £ ■ a{G)pR. Therefore, 

denoting by m the number of vertices in H if a(G) > (1/2 — e)N, then a(H ) > (1/2 — 3 e)m, and 
hence H contains a vertex cover of size (1/2 + 3 e)m. On the other hand, if a(G) < eN, then with 
high probability a(H) < 3 em, and and hence H does not have a vertex cover of size (1 — 3 e)m. □ 

3 Hamiltonicity and Percolation 

Recall that an Hamiltonian cycle in a graph is a cycle that visits every vertex exactly once. Deciding 
if a graph (whether directed or undirected) contains a Hamiltonian cycle is a classical ATP-hard 
problem, which we denote by HamCycle. A hamiltonian path, is a simple path that traverses all 
vertices in the graph. 

In this section we prove unless J\fP = colZV, there is no polynomial time algorithm that given 
a n-vertex directed graph G decides with high probability whether G P)e contains a Hamiltonian 
cycle for any p > —^ where e € (0,1). 

A natural approach in proving that deciding the Hamiltonicity of percolated instances is hard, 
is to “blow up” edges. Namely to replace each edge (u, v) by a clique of size k and connect both 
endpoints of the edges to all vertices of the clique. The idea is that when k is large enough, there is 
a Hamiltonian path with high probability between all pairs of distinct vertices of the clique. Hence 
with high probability, we can connect u and v after percolating the edges, by a path that traverses 
all the vertices of the percolated clique. The problem with this idea, is that the resulting graph after 
this blowup operation may not be Hamiltonian as there is a new set of vertices for every “edge” 
in the original graph that needs to be traversed by an Hamiltonian cycle. For directed graphs, we 
overcome this problem by adding to each vertex v a large clique C, adding a directed edge (v, c) 
for every c£C and and adding a directed edge (c, u) for every c € C and u € N(v) (where N(y ) 
is the set of all vertices having a directed edge from v). 

Theorem 3.1. Let £ € (0,1) be a fixed constant. Then, unless AfV = 1ZV, there is no polynomial 
time algorithm that when given a directed graph G = ( V ., E) with n vertices decides with high 
probability whether G p ^ e contains a Hamiltonian cycle for any p > . 

We will need the following claim. 

Claim 3.2. Let H = (V, E ) be the directed graph with a source s a sink t, and R vertices JJ = 
{ui,...,Ufc}. The edges of H are 

E = {(s —> uf) : i £ [R]} U {(uj —tt):i€. [R]} U {(u* —> Uj) : i,j € [R]}- 

Let H' = (V, E') be an edge percolation of H, where we keep each directed edge with probability 
p = ■ Then, with probability 1 — there is a Hamiltonian path in H from s to t. 
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Proof. Let po E (0,1), and consider the random graph H Po . Note that with probability at least 
1 — 2(1 — po) R there are two distinct vertices v\,vr E U such that (s —» v\),(vr —> t) E E'. 
Conditioning on these specific v\,vr E U, we show that with high probability there is a Hamiltonian 
path from v\ to vr in the subgraph of H po induced by U. 

By a result of [18, Theorem 1.3] if D is a po-edge percolation of the complete directed graph 
with R vertices with po = ' , then with high probability all edges of D are contained in some 

Hamiltonian cycle in D. Note that the probability that H po contains a Hamiltonian path from v\ 
to vr is equal to the probability that H po contains a Hamiltonian cycle that goes through the edge 
(v\ —>• vr ), conditioned on the event that (iq —>■ vr) E E'. Therefore, since the distribution of 
the subgraph of H Po induced by U is distributed like D, it follows that with high probability the 
subgraph H Po induced by U contains a Hamiltonian path from v\ to vr, and hence H Po contains a 
Hamiltonian path from s to t with probability at least 1/2. 

Next, let t = 31og(.R), and let p = t ■ po- We claim that the graph H p contains an Hamiltonian 
path from s to t with probability at least jp. Observe that if H]...., H[ are independent copies of 
H Po , then the probability that none of the H[ contains a Hamiltonian path from s to t is at most 
(1/2)* < jp. Therefore, since each edge of H is contained in U* = 1 iT; independently with probability 
1 — (1 — poY < p it follows that H p contains an Hamiltonian path from s to t with probability at 
least -^ 3 -, as required. □ 

Proof of Theorem 3.1. In order to prove the theorem, we show a reduction that given a directed 
graph G = (V,E) produces a directed graph G' = (V',E') such that 

• If G contains a Hamiltonian cycle, then G' contains a Hamiltonian cycle, and with high 
probability G' p e contains a Hamiltonian cycle. 

• If G does not contain a Hamiltonian cycle, then neither does G', and hence G pe does not 
contain a Hamiltonian cycle. 

The reduction works as follows. Let V = [N] be the vertices of G, and let R be a parameter 
to be chosen later. The vertices of G' will be V' = H(J(U^ 1 C/j), where Ui = {u\,... ,u l R }. For 
each i E [N] the graph G' contains all edges in both directions inside Ui. For each directed edge 
(i —>• j) E E we add in G' the directed edges 

{(* —> u\) : i E [-R]} U {(i4 > j) '■ £ £ [-R]}- 

That is, we turn the graph G into G' by adding a clique Ui for each vertex Vi E V, and letting all 
edges outgoing from Vi go through this clique. This completes the description of the reduction. 

Let us first show that that G contains a Hamiltonian cycle if and only if G' contains a 
Hamiltonian cycle. Indeed, suppose that C = (< 7 i,..., cr/v) is a Hamiltonian cycle in G. Then 
C' = (< 7 i, uf 1 ..., ufl ,..., ajsr,Ui N • • •, uff 1 ) is a Hamiltonian cycle in G'. In the other direction, 
suppose that G' contains a Hamiltonian cycle C'. It is easy to see that any iEk appearing in C' 
must be followed immediately by a permutation of all R vertices in U t . Therefore, by restricting 
C' to the vertices in V we get a Hamiltonian cycle in G. 

Next we show that the reduction above is robust to edge percolation. Let G' = G' p ^ e be the 
edge percolation of G'. Clearly if G' does not contain a Hamiltonian cycle, then neither does 
G'. Therefore, it is only left to show that if G' contain a Hamiltonian cycle C, then with high 
probability G' also contains a Hamiltonian cycle. As explained above a Hamiltonian cycle in G' 
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is given by a permutation a = (oi,... <tjv) G Sjv and an arbitrary ordering of the vertices in 
each Ui, i.e., C' = (cti, u^ 1 ..., u R ,..., a n, u![ n ..., u^*). Note that for each i € [N] the vertices 
{cij, Ui* ..., u° } j, induce a subgraph isomorphic to the graph H from Claim 3.2. Therefore, by 

Claim 3.2 if p > log fi ^ , then for each i € [N] with probability 1 — there is path from to cr t +i 
that visits all vertices in U ai . By taking union bound over all i € [N] we get that with probability 
1 — jp such paths exist for all i € [IV], and by concatenating them we conclude that G' contains a 
Hamiltonian cycle with high probability. 

Finally, we specify the choice of the parameter R. The obtained graph H has n = NR vertices, 
and the constraints we have are p > log R R and R 3 3> N. Therefore, in order to prove the theorem 
for p > with e € (0,1) it is enough to take R = N 1 / c , where c = > £ such that 


4 Constraint Satisfaction Problem and Percolation 

In this section we deal with percolation on Constraint Satisfaction Problems (CSP). An instance 
of Boolean k-CSP is a formula consisting of a collection of clauses Ci,...,C m over n Boolean 
variables x \,..., x n , where each clause is associated with some A:-ary predicate / : {0, l} fc —> {0, 1 } 
over k variables ,..., Xi k . An instance ‘h is said to be simple of all clauses in <P are distinct. Given 
an assignment a : {xi, ...,x n } —>• { 0 , 1 } we say that the constraint C on the variables x ^,... ,Xi k is 
a satished by a if fc(o'(xi 1 ,...,a(xi k )) = 1, where fc is the predicate corresponding to C. Given 
a formula <h, and an assignment o to its variables the value of $ with respect to the assignment 
a, denoted by val a (<&), is fraction of constraints of <1? satished by a. The value of is defined as 
val{<&) = max CT If val(&) = 1 we say that is satishable. 

We are typically interested in CSP where constraints belong to some fixed family of predicates J~. 
For example, in the A:-SAT problem, the constraints are all of the form f(z \,..., Zk) = Vi=i i z i = h), 
for b\, ..., bk € {0,1}. We assume that k, the arity of the constraints, is some fixed constant that 
does not depend on the number of variables n. 

These definitions give rise to the following optimization problem. Given a CSP instance $ find an 
assignment that maximizes the value of <h. We refer to this maximization problem as Max-CSP-J 7 , 
where J- denotes the family of predicates constraints are taken from. For 0 < s < c < 1, let 
Gap-CSP-J r (c, s) be the promise problem whose YES-instances are formulas such that val($) > c, 
and NO-instances are formulas T such that val{<&) < s. Here we assume the constraints of CSP 
instances are restricted to be in some family T. 

We study two models of percolation on instances of CSP. In clause percolation given an instance 
$ of CSP its clause percolation is a random formula d>p over the same set of variables, that is 
obtained from T by keeping each clause of <!> independently with probability p. 

In variable percolation given an instance <!> of CSP the variable percolation is a random formula 
whose set of variables is a subset S of the variables of <h, where each variable of is in S 
independently with probability p € (0,1) and the clauses of are all clauses of d> induced by S. 
In other words, a clause C of $ survives if and only if all variables from C the percolation process. 

Clause percolation In this section we show that for Constraint Satisfaction Problem with a k -ary 
constraints, the problem of approximating the optimal value on percolated instances is essentially 
as hard as approximating it on a worst-case instance as long as p > k -i-s f° r an Y constant 5 > 0 . 
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Theorem 4.1. Let e,d € (0,1) be fixed constants. There is a polynomial time reduction such that 
given a simple unweighted instance 4* outputs a simple unweighted instance 'I' on N variables with 
the same constraints, such that val(^>) = val(&), and furthermore for any p > k-i-s the following 
holds. 

1. If val(f&) = 1, then val(At p ) = 1 with probability 1. 

2. If val($>) < 1, then with high probability \val{^> p ) — val(&)\ < e. 

Theorem 4.1 immediately implies the following corollary. 

Corollary 4.2. Let T be a collection of Boolean constraints of arity k, and suppose that for some 
0 < s < c< 1 the problem Gap-CSP-F{c, s) is NV-hard. Then Gap-CSP-J r (c—e, s+e) is NV-hard 
under a robust reduction with respect to clause percolation with any parameter p > k } 1 _s , where 

n denotes the number of variables in a given formula, and e, 6 > 0 are arbitrary constants. 

As a particular application, we have the following result regarding the hardness of approximating 
3-SAT on clause-percolated instances. 

Theorem 4.3. Let e, 6 G (0,1) be fixed constants. Then, unless J\fV C colZV, there is no polyno¬ 
mial time algorithm that when given a satisfiable instance 4> of 3-SAT finds an assignment a to & p 
such that val a (<&p) > 7/8 + e for all p > 

Proof. By the result of Hastad [22] for any constant e > 0 given a weighted 3-SAT instance 
4> it is A/'T 7 -hard to distinguish between the case that that cf is satisfiable, and the case that 
val((f >) < 7/8 + e. Combining this result with the result of [12] we get that the same problem is 
AAP-hard also for unweighted simple instances. The proof follows by applying Theorem 4.1. □ 

We now return to the proof of Theorem 4.1. 

Proof of Theorem f.l. We start with the following lemma. 

Lemma 4.4. Let 4> be a simple unweighted a k-CSP instance with n variables and m clauses, and 
let p > for some e € (0,1) and some absolute constant C > 0. Then, 

1. If val($>) = 1, then val(^ p ) = 1. 

2. val($) < 1, with high probability \val(^ p ) — val(&)\ < e. 

Proof. The first item is clear, as any assignment that satisfies 4> will also satisfy 4>/. For the second 
item, denote by m! the number of clauses in 4>/. By concentration bounds we have 

Pr[\m'-pm\ > epm] < e ~^ £2pm) < e ~^ Cn \ 

where fl(-) hides some absolute constant. Fix an assignment u to the variables of <3?, and let 
s = ua/ cr (4>). Then, the number of clauses in 4> satisfied by a is sm. Denote by S a the number 
clauses in <& p satisfied by a. Since we pick each clause with probability p independently, we have 

Pr[\S a — spm\ > epm} < e~ n< ' £2pm ^ < e~^ Cn \ 
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and hence 


Pr[| val a (&p) — s| > e] = PrfIS'o- — sm'\ > em'] 

< Pr[|m/ — pm\ > epm/2] + Pr[|5 CT — spm\ > epm/2] 

< 2e~ u{ - Cn \ 

where fl(-) hides some absolute constant. 

Suppose now that that ua/(<3>) = s. If a is an optimal assignment to <3>, i.e., val a (f&) = s, then 
we immediately have by the argument above that val a (^//) > s — e with high probability. On the 
other hand, for any assignment a' it holds that Pr[uaC'(‘hp) > s + e] < e~ n ( Cn ') for some sufficiently 
large C > 0, and by taking union bound over all assignments a we get 

Pr [val($p) > s + e] < Pr[3cr / such that val a '(<& c p ) > s + e] < c n 
for some absolute constant c < 1. □ 

We note that we assume in the proof above that s is a constant independent of n. This 
assumption is justified as s > and we assume that k is independent of n. 

Next, we show a polynomial time reduction such that given a Max-CSP-J 7 instance <3> outputs 
a Max-CSP-J 7 instance T with N variables and N k ~ e clauses such that na/(T) = val(&). We use 
similar ideas to those used in [12] in proving that unweighted instances of CSP problems are as 
hard to approximate as in the weighted case. 

Lemma 4.5. For any (5 £ (0,1) there is a polynomial time reduction such that given a simple un¬ 
weighted Max-CSP-J- instance outputs a simple Max-CSP-T instance T with the same constraint 
with n variables and at least n k ~ s clauses such that val(^f) = val(&). 

Proof. The reduction works as follows. Let R be a parameter to be chosen later. Given an instance 
of fc-CSP with M clauses over the variables x \,..., Xn the reduction creates the following instance 
\k. For each variable x% of <3?, the instance T will have a set of R corresponding variables Xj = 
{ x i,j '■ j £ [-R]}, where we think of each variable in Xi as a copy of x*. For each clause C of <1? we 
add to $ a cloud of R k corresponding clauses, by taking all possible combinations of the variables 
from the corresponding Xfs. We call the set of R k clauses corresponding to C the cloud of C. 
That is, T has n = NR variables and m = M ■ R k clauses. Therefore, if R > N k/S , then m > n k s . 

Next we claim that val{<&) = val(^>). Clearly, we have val(&) < na/(T), as any assignment 
a : {x\,... ,xn} € {0,1} to <3> can be extended to the assignment r to T by letting r(xij) = a(xi ) 
for all i € [N ], j £ [i?]. 

In the other direction, let t be an assignment to the variables of T. 1 For each i € [N] let 
p) = 1 ^' be the fraction of x t j's that are assigned the value 1, and let p ( - = 1 — p\. 

be the fraction of Xjj’s that are assigned the value 0. Construct a assignment a to the variables 
of <3> randomly, by setting a(xi) = 1 with probability p \, and setting a(xi) = 0 with probability 
p { - independently. Equivalently we choose one of the R copies of Xi in T uniformly at random and 
assign to Xi the value assigned by r to the variable chosen. A moment of thought reveals that for 
each clause C of <3>, the probability that a satisfies C is equal to the fraction of the clauses in \It in 
the cloud corresponding to C that are satisfied by r. Denote by SAT a (Ci) the number of clauses 

1 Note that if for each i £ [JV] the assignment r gave the same value to all variables in Xi, this would naturally 
induce a corresponding assignment to < f > . However, this need not be the case in general. 
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that are satisfied by a in the cloud corresponding to Q. Since each clause of <f> corresponds to the 
same number of clauses in 'F, it follows that the expected value of 4> under the assignment a is 

1 M 

E[ua/ (T ( < 1>)] = — Pr[fj satisfies Cj\ 

i =1 

1 y, SAT a (Ci) 

M ^ R 3 

i =1 

= val T (fi!). 

Hence, there exists an assignment o to the variables of <F such that val a (&) > val T ( T), and thus 
val{<&) > val( x F), as required. □ 

Theorem 4.1 follows immediately from Lemmas 4.4 and 4.5. 

□ 

We observe that it is unlikely that Lemma 4.5 could be generalized to Max-CSP-J 7 instances 
with-arity k and f l(n k ) constraints. For example, the value of a 3-SAT formula with 0(n 3 ) clauses, 
admits 1 — 5 approximation for every 5 € (0,1) in polynomial time [5]. 

Variable percolation Next we show that Max-CSP-J 7 is also hard under variable percolation. 
We prove below that for p that is no too small, with high probability Max-CSP-J 7 is hard to 
approximate on percolated instances within the same factor as in the worst-case setting. 

Theorem 4.6. Let e,5 > 0 be fixed constants. There is a polynomial time reduction such that 
given a simple unweighted instance <F outputs a simple unweighted instance \F on n variables with 
the same constraints, such that val(fi>) = val($>), and furthermore for any p > ^4^ the following 
holds. 

1. If val($>) = 1, then val(^> p ) = 1 with probability 1. 

2. If val(3>) < 1, then with high probability \val{fi! v p ) — val(<&)\ < e. 

The following corollary is the analogue of Corollary 4.2 for variable percolation. 

Corollary 4.7. Let T be a collection of Boolean constraints of arity k, and suppose that for some 
0 < s < c < 1 the problem Gap-CSP-R^c, s) is NV-hard. Then Gap-CSP-IF(c—e, s+e) is NT-hard 
under a robust reduction with respect to vertex percolation with any parameter p > , where n 

denotes the number of variables in a given formula, and s, 5 > 0 are arbitrary constants. 

Proof of Theorem f.6. The reduction is the same reduction as in the proof of Theorem 4.1. Namely, 
given a simple unweighted instance with N variables and M clauses the reduction replaces each 
variable Xi of <h, with a set of R corresponding variables X,; = {xij : j € [-R]}, and replaces each 
clause of <h with a cloud of R k corresponding clauses, by taking all possible combinations of the 
variables from the corresponding Xfs. That is, the output of the reduction T has n = NR variables 

and m = M ■ R k clauses. We choose R = N 1 / c , where c = £ ( 5 ,1) so that \J 

For each i € [N] let X[ be variables from Xj that remain in after variable percolation. By 
concentration, it follows that forp > with high probability ||Xl|— pR\ < 0(yJpR log(n)) for all 
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i € [N], We assume from now on that this is indeed the case. For a constraint Ci of d> let xq,..., Xi k 
be the variables that participate in Q. Then, the number of clauses in the cloud corresponding to 
Ci in dp is equal to \X[ i j • • • \X[ k |, and the total number of clauses in dp is l-^ql ''' I XjJ. 

By Lemma 4.5 we have val{ d/) = val(&). In particular, if d> is satisfiable, then so if d/, as 
any assignment that satisfies T also satisfies any sub formula of dq which implies that dp is also 
satisfiable with probability 1. 

Suppose now that val(&) < 1. We claim that with high probability |uaZ(dp) — val(Q)\ < e. 

To prove that uapdp) > ua/(d>) — e, let a be an optimal assignment to d>. Extend a to an 
assignment r to dp by letting r(xjj) = a(xi ) for all 1 < i < R. Note that for each constraint 
Ci of d> if Ci is satisfied by cr, then in dp all clauses in the corresponding cloud are satisfied, and 
otherwise no clause in the corresponding cloud is satisfied. Denoting by SAT r (Ci ) the number of 
clauses that are satisfied by r in the cloud corresponding to Ci we have 




E m W' \ I vf I 
i =l l A u r ’ ’ l A zJ 


> > vaim _ 0( 


M{ P R + \JpR log(N)) k 
By the choice of R we get for large enough N 


'log(AO- 
pR 


val T (^ v p ) > val(<S>) - > val($) - e. 


Next, we prove that ua/(d?) > uai(dp) — e. Given an assignment r to the variables of dp we 
decode it into an assignment to d> using the same decoding as in the proof of Lemma 4.5. Namely, 
we choose a random assignment a to the variables of d> by setting a(xi) = 1 with probability pj 
and cr(xj) = 0 with probability pd independently between V s, where pj = ^ anc | 

p® = 1 — pj. Let C[ be the set of clauses in C, that belong to dp. Let SAT r (C[ ) the number of 
clauses that are satisfied by r in Cj. it follows that the expected value of d> under the assignment 
a is 


eK($)] 


— ^2 P r [<7 satishes Cj 
1=1 


1 y SAT r (Cj) 
M 4^ IX' I ••• IX' 

i =1 ' ^1 ' ' 


( 2 ) 


On the other hand we have 


val T ( dp) 


YZiSAT T {Cj) _ 

E m i vi i iv/ i' 

i =1 l A ql ' ‘ ‘ I A iJ 


(3) 


Now, using the assumption that for all i € [n] it holds that \\Xj\ — pR\ < yJpR log(n), we get that 


both (2) and (3) are between 


E& sat t (c') 


and 


Efa/ SAT t (C<) 


A simple computation 


M(pR+y/pRlog(N)) k M(pR—y/pR\og(N)) k ' 

reveals that the difference between the two quantities is at most and hence 


E[uaG( < I > )] > val T ( dp) - OU 


Aog(N) 

pR 


) > val T {%) - O(^) > val r {%) - e 


This completes the proof of Theorem 4.6. 


□ 
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5 The Subset Sum Problem and Percolation 


In this section we consider the subset-sum problem, and its percolated version. In the subset-sum 
problem we are given a set items {o,i}f =l which are positive integers, and a target integer S. The 
goal is to decide whether there is a subset of afs whose sum is S. 

Given an instance / = ({«*}?= i> &) of the subset sum problem, we define a percolation on I with 
probability p to be a random instance I p , where each item ai is included in I p with probability p 
independently, with the target of I p being the same as the target of I. 

It is known that subset sum is A/T-hard. Below we prove hardness of the percolated version of 
the subset sum problem. 

Theorem 5.1. Rhe Subset-Sum problem is MV-hard under robust reduction with respect to per¬ 
colation with parameter p for any p > if 2 -e > where n is the number of items in a given instance, 
and e > 0 is any fixed constant. 

Proof. In order to prove the theorem, we show a reduction that given an instance I = ({a*}^; S) 
of the subset-sum problem with all a* > 0, produces an instance I' on n variables such that the 
following two properties are satisfied. 

• If I G Subset-Sum, then /' € Subset-Sum, and furthermore, with high probability I p G 
Subset-Sum. 

• If / ^ Subset-Sum, then I' ^ Subset-Sum, and hence I p ^ Subset-Sum with probability 1. 

Let us assume that the number of items in I is even. (If N is odd, then, add an item to / that is 
equal to zero). Let R be a parameter to be chosen later, let N' = [dog 2 QG,; Oj)], and for i = 1,..., n 
let Mi = 2 C ( N for a large enough constant C' . For each i € [N] define the following set 

Ji = {Mi + m ■ N 3 + k : k € {—R, ■ ■ ■, R}} and J[ = {Mi + k : k € {—R, ■ ■ ■, R}} 

Consider now the instance 

where S' = S ■ N 3 + Mi. Clearly this is a polynomial reduction that outputs a Subset-Sum 
instance with n = 2NR items. 

We show first that I G Subset-Sum if and only if I' G Subset-Sum. Indeed, suppose that for 
some subset T C [N] it holds that ^ieT a * = Consider the following subset of items of I'. For 
each i G T take the item from Jj that corresponds to k = 0, and for i G [N] \ T take the item from 
J'j that corresponds to k = 0. Then, by taking these items we are getting 

• n3 + M i) + 2 Mi = S'. 

i£T ie[N]\T 

In the other direction, suppose that I' G N. Then, there is some subset T' C [N] x {0,1} x 
{—R,... ,R} such that 


(Mi + ai- N 3 -t + k) = S' = S ■ N 3 + Y M i- 

(■ i,t,k)eT ' ie[JV] 
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Note that by the choices of Mi (namely because Mi s are much larger than afs and R ) for each 
i £ [N] there is a unique ti £ {0,1} and a unique ki £ {—R,...,R} such that ( i,ti,ki ) £ T'. 
Therefore, since YliLi\ki\ < IV.D < IV 3 , it follows that X^ie[7V] ki = 0, and hence by defining 
T = {i £ [IV] : ti = 1} we get that YlieT a * = and so I £ Subset-Sum. 

Next, we claim that the reduction above is in fact robust. Indeed, consider the percolated 
instance I' p for some p £ (0,1]. Note that if J ^ Subset-Sum, then neither is I', and hence 
Ip (f Subset-Sum with probability 1. It remains to show that if I £ Subset-Sum, then with high 
probability I' £ Subset-Sum. The proof relies on the following claim. 

Claim 5.2. Let N £ N be even, and let R £ N. Let Hi,..., A n C {—R ,..., R} be random sets 
chosen by letting each k £ {—R, ■ ■ ■, R} to be in Ai with probability p independently of each other. 
Then, with probability > 1 — N/2 ■ (1 — p 2 ) 2R for each i £ [n] there is ki £ Ai such that YliLi ki = 0- 

Proof. Note that for each odd i £ [N], the probability for a fixed element x € {—R, ■ ■ ■ ,R} that 
both x £ Ai and — x € ^4j+i hold is p 2 . Therefore, 


Pr[3&; € {— R,... ,R} : k £ Ai and — k £ vl*+i] = 1 — (1 — p 2 ) 2R+1 . 


Hence, by taking the union bound over all pairs (i, i + 1) with odd values of i we get that with 
probability at least 1 — N/2 ■ (1 — p 2 ) 2R , for all odd V s there is ki £ Ai such that — ki £ Ai + \. □ 

Suppose now that I £ Subset-Sum, i.e., for some subset T C [ N ] it holds that J2ieT a i = &• 
Note that the percolated instance I' p is obtained from I' by taking random subsets of Jj and J[ 
independently of each other. For i £ [IV] define Ai to be the p-percolated subsets of J* if i £ T , and 
define Aj to be the p-percolated subsets of J[ if i (f T. Note that if R > then the conclusion 

of Claim 5.2 holds with probability at least 1 — 1/N. Therefore, in the percolated instance I' p by 
taking the items of I' p from Hj’s that correspond to k £ Af s from Claim 5.2 we get 


+ ai ■ N 3 + ki) + (Mi + ki) 

ieT ie[N]\T 


( X/ + ai ' + ( X/ ^i) 

is [AT] i&T ie[JV] 

( ^ [ Mi) + 5 + 0 

ie[N] 

S'. 


Therefore, with high probability I' p £ subset-sum as required. 

Finally, note that the reduction works as long as R > C , or equivalently p > C^J l ° g R N ' . It 
is easy to verify that for R = N ly/c , with c = , the foregoing reduction is indeed a robust 

reduction with respect to percolation with parameter p > i/2- e f° r an y constant e > 0, where n is 

the number of items in the Subset-Sum instance. □ 


6 Conclusion 

We have examined the complexity of percolated instances of several well known A/'T’-hard problems 
and established the hardness of solving exactly and approximately these problems on such instances. 
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It might be of interest to study percolated instances of other ATP-hard problems that were not 
considered here. 

There are several question arising from this work. For the HamCycle problem it would be 
interesting to determine whether vertex-percolated instances are hard (in directed or undirected 
graphs). Currently, we are unable to establish that this problem is hard even if every vertex remains 
with probability p < 1, where p is a constant that does not depend on the size of the graph. It could 
be the case that there is no reduction from AfV to HamCycle that is robust to vertex percolation. 
Proving the inexistence of such reductions (if true) could be of interest. 

It might also prove worthwhile to determine whether percolated instances of 3-SAT remain 
hard to solve even if p = 0(l/n 2 ) over n- variable formulas. Several works suggest that finding a 
satisfying assignment for a random 3-SAT instance with n variables and C\n clauses is hard when 
C\ is close to the satisfiability threshold [30]. Other works suggest that certifying the unsatisfiability 
of a random 3-SAT instance with C 2 n clauses, with C 2 being large enough, is difficult as well [15]. 
These works may serve as evidence that 3-SAT should be hard to solve for p = 0(l/n 2 ). 

One of the first algorithms for solving independent sets in the random graph G(n,p ) is the Karp- 
Sipser algorithm [24], This algorithm works by choosing iteratively a degree one vertex randomly, 
adding it to the independent set, and removing both the selected vertex and its sole neighbor from 
the graph. When there are no vertices of degree one, the algorithm terminates. It was proven in [4] 
that when p < ^ (where e is the base of the natural logarithm, e = 2.71828...) the algorithm finds 
with high probability an optimal independent set. When p > the Karp-Sipser algorithm fails 
(with high probability) to find a maximum independent set of the graph [24]. Despite extensive 
research, no algorithm is known to find an optimal independent set in G(n,p ) when p > 

It is not clear whether the Karp-Sipser algorithm works on random subgraphs of worst-case 
graphs, as opposed to a random subgraph of the complete graph. This leads to the following 
problem. 

Problem 6.1. Let p < Is there a polynomial-time algorithm that given an n-vertex graph G 
finds a maximal independent set in G Pie with high probability? 

Is it true that for p > ^ no algorithm can find a maximal independent set in G P)e for worst case 
instance G, unless AfV C BVV ? 
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